Cleansing

EDA

Starting with we need to treat missing values for the columns 'work_interfere' and 'self_employed'

A 99% of the people who didn't answet the question 'Whether mental health interfers with work' don't seek help. If we think null values as a category, 'NEVER' of all categories of column_work_interfere.

As there are no missing values we proceed to the next step.

All columns are categorical, except age.

Cardinal Information

Data Preprocessing part ends here. Further, we move to Model Building.

Model Building and evaluation

Logistic Regression

Among the models, we narrow down the most promising models to fine tune the parameters further